Binaural cue coding-Part II: Schemes and applications
نویسندگان
چکیده
Binaural Cue Coding (BCC) is a method for multichannel spatial rendering based on one down-mixed audio channel and side information. The companion paper (Part I) covers the psychoacoustic fundamentals of this method and outlines principles for the design of BCC schemes. The BCC analysis and synthesis methods of Part I are motivated and presented in the framework of stereophonic audio coding. This paper, Part II, generalizes the basic BCC schemes presented in Part I. It includes BCC for multichannel signals and employs an enhanced set of perceptual spatial cues for BCC synthesis. A scheme for multichannel audio coding is presented. Moreover, a modified scheme is derived that allows flexible rendering of the spatial image at the receiver supporting dynamic control. All aspects of complete BCC encoder and decoder implementations are discussed, such as down-mixing of the input signals, low complexity estimation of the spatial cues, and quantization and coding of the side information. Application examples are given and the performance of the coder implementations are evaluated and discussed based on subjective listening test results.
منابع مشابه
Binaural cue coding-Part I: psychoacoustic fundamentals and design principles
Binaural Cue Coding (BCC) is a method for multichannel spatial rendering based on one down-mixed audio channel and BCC side information. The BCC side information has a low data rate and it is derived from the multichannel encoder input signal. A natural application of BCC is multichannel audio data rate reduction since only a single down-mixed audio channel needs to be transmitted. An alternati...
متن کاملCoding of Spatial Audio Compatible with Different Playback Formats
Recently, various schemes were proposed for parametric coding of stereo and multi-channel audio signals. Binaural Cue Coding (BCC) is such a technique. It represents multi-channel audio signals as a single downmixed channel plus a small amount of side information. BCC can be applied to mono and stereo backwards compatible coding of multi-channel audio signals. In this paper, we propose a genera...
متن کاملParametric Coding of Spatial Audio
Recently, there has been a renewed interest in techniques for coding of stereo and multi-channel audio signals. Stereo and multichannel audio signals evoke an auditory spatial image in a listener. Thus, in addition to pure redundancy reduction, a receiver model which considers properties of spatial hearing may be used for reducing the bitrate. This has been done in previous techniques by consid...
متن کاملHärmä and Faller Spatial Decomposition
Techniques where a stereo or a multichannel signal is decomposed into spatial source-labeled time-frequency slots by level, time-difference, and coherence metrics have become popular in recent years. Good examples are binaural cue coding and up/downmixing techniques. In the article, we will provide an overview and discuss parallel approaches in the field of array processing and blind source sep...
متن کاملRobustness analysis for multi-channel hearing aid algorithms with binaural output by means of objective perceptual quality measures
Introduction According to the ITU-T P.835 recommendation, subjective quality evaluation of noise reduction schemes involves (i) the perceived quality of the speech signal, (ii) the quality of the background signal and (iii) the overall quality. In [7] it has been shown that these subjective measures are predictable by objective measures in the case of monaural noise reduction schemes. In this s...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- IEEE Trans. Speech and Audio Processing
دوره 11 شماره
صفحات -
تاریخ انتشار 2003